National Repository of Grey Literature 7 records found  Search took 0.01 seconds. 
Models and statistical analysis of record processes
Tůmová, Alena ; Volf, Petr (advisor) ; Hlubinka, Daniel (referee)
In this work we model the historical development of best performances in men's 100, 200, 400 and 800m running events. We suppose that the years best performances are independent random variables with generalized extreme value distribution for minima and that there is a decreasing trend in location. Parameters of the models are estimated by using maximum likelihood techniques. The data of years best performances are missing for some years, we treat them as right censored data that are censored by value of world record valid at that time. Graphic tools used for models diagnostics are adjusted to the censoring. The models we get are used to estimate the ultimate records and to predict new records in next years. At the end of the work we estimate several models describing historical development of years best performances for more events at one time.
Analysis of Missing Data: Comparing Performance of Traditional Methods across Mechanisms
Petrúšek, Ivan ; Soukup, Petr (advisor) ; Hendl, Jan (referee)
The objective of this thesis is to evaluate different methods of dealing with missing values in data analysis. The thesis is divided into three major chapters. The first chapter summarizes the theoretical literature on missing data and focuses on missing data mechanisms in particular. The second chapter introduces traditional methods for addressing missing data in sociological research. The third chapter assesses the performance of these methods by analyzing simulated data sets for two variables (income, IQ). For practical analysis (chapter 3), we simulated missing data according to three different mechanisms (MCAR, MAR, NMAR) and varied the proportion of missing values under these mechanisms (10%, 20%, 30%). Then, we applied each of the following four methods of addressing missing values: complete-case analysis, arithmetic mean imputation, regression imputation, and stochastic regression imputation. In order to evaluate the performance of each of these methods we performed correlation and regression analyses for each experimental condition. The results of these simulations are largely in agreement with existing theoretical literature on the subject of missing data. In the case of NMAR, all solution methods provided biased parameter estimates. In the case of MCAR, only complete-case analysis and...
Imputation of missing values in clinical data
BIRKLBAUER, Micha Johannes
Imputation of missing data is a crucial step in data analysis since many statistical methods require complete datasets. In that regard MissForest imputation is a powerful tool that seems to outperform most other imputation approaches. This analysis evaluates how good imputation using MissForest is compared to other methods like imputation by Multivariate Imputation by Chained Equations (MICE), Restricted Boltzmann Machines (RBM) or the standard strawman (mean) imputation in a clinical dataset that is used to predict the mortality of patients after heart valve surgery.
Statistical Methods for Regression Models With Missing Data
Nekvinda, Matěj ; Kulich, Michal (advisor) ; Omelka, Marek (referee)
The aim of this thesis is to describe and further develop estimation strategies for data obtained by stratified sampling. Estimation of the mean and linear regression model are discussed. The possible inclusion of auxiliary variables in the estimation is exam- ined. The auxiliary variables can be transformed rather than used in their original form. A transformation minimizing the asymptotic variance of the resulting estimator is pro- vided. The estimator using an approach from this thesis is compared to the doubly robust estimator and shown to be asymptotically equivalent.
Analysis of Missing Data: Comparing Performance of Traditional Methods across Mechanisms
Petrúšek, Ivan ; Soukup, Petr (advisor) ; Hendl, Jan (referee)
The objective of this thesis is to evaluate different methods of dealing with missing values in data analysis. The thesis is divided into three major chapters. The first chapter summarizes the theoretical literature on missing data and focuses on missing data mechanisms in particular. The second chapter introduces traditional methods for addressing missing data in sociological research. The third chapter assesses the performance of these methods by analyzing simulated data sets for two variables (income, IQ). For practical analysis (chapter 3), we simulated missing data according to three different mechanisms (MCAR, MAR, NMAR) and varied the proportion of missing values under these mechanisms (10%, 20%, 30%). Then, we applied each of the following four methods of addressing missing values: complete-case analysis, arithmetic mean imputation, regression imputation, and stochastic regression imputation. In order to evaluate the performance of each of these methods we performed correlation and regression analyses for each experimental condition. The results of these simulations are largely in agreement with existing theoretical literature on the subject of missing data. In the case of NMAR, all solution methods provided biased parameter estimates. In the case of MCAR, only complete-case analysis and...
Models and statistical analysis of record processes
Tůmová, Alena ; Volf, Petr (advisor) ; Hlubinka, Daniel (referee)
In this work we model the historical development of best performances in men's 100, 200, 400 and 800m running events. We suppose that the years best performances are independent random variables with generalized extreme value distribution for minima and that there is a decreasing trend in location. Parameters of the models are estimated by using maximum likelihood techniques. The data of years best performances are missing for some years, we treat them as right censored data that are censored by value of world record valid at that time. Graphic tools used for models diagnostics are adjusted to the censoring. The models we get are used to estimate the ultimate records and to predict new records in next years. At the end of the work we estimate several models describing historical development of years best performances for more events at one time.
Problém chybějících dat při sčítání lidu - kteří respondenti neodpověděli
Hora, Jan
The confidentiality of census data is known to be rather restrictive for economic and social research. To improve the availability of census information we have proposed recently a new method of interactive presentation of census results by means of statistical models. The method is based on estimation of the joint probability distribution of data records in the form of a distribution mixture. The estimated mixture model can be used as a knowledge base of a probabilistic expert system and in this way we can derive the statistical information from the distribution mixture without any further access to the original database. The statistical model does not contain the original data and therefore the final interactive software product can be made freely available via internet without any confidentiality concerns.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.